Memory Fault Tolerance Software Mechanisms: Design and Configuration Support through SWN Models

نویسندگان

  • Paolo Ballarini
  • Lorenzo Capra
  • Giuliana Franceschinis
  • Massimiliano De Pierro
چکیده

This paper presents a case study of a software fault tolerance mechanisms, the distributed memory, designed and implemented within the european projects TIRAN and DEPAUDE, and currently under study within the Italian project ISIDE. The studied mechanisms are part of a complete framework of general purpose software fault tolerance mechanisms. In this paper we show a method for the compositional construction of models of the DM and of the environment in which it operates, expressed in the Stochastic Well Formed Nets (SWN) formalism. Different versions of submodels, at different detail level are presented and compared using some behaviour inheritance notions taken from the literature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Validation and Evaluation of a Software Solution for Fault Tolerant Distributed Synchronization

This paper presents a case study on the combined use of different tools and techniques for the validation and evaluation, from the early stages of the design, of a fault tolerant software mechanism named distributed synchronization The mechanism has been specified using UML state charts and sequence diagrams. A number of Stochastic Well-formed Nets (SWN) models have been derived from the specif...

متن کامل

A Survey on Fault Tolerance in Work flow Management and Scheduling

Fault Tolerance is a configuration that prevent a computer or network device from failing in the event of unexpected problem or error such as hardware failure, link failure, unauthorized access, variations in the configuration of different systems and system running out of memory or disk space. The integration of fault tolerance measures with scheduling gains much importance. Workflow managemen...

متن کامل

Fault Tolerant Memory Design for HW/SW Co-Reliability in Massively Parallel Computing Systems

A highly dependable embedded fault-tolerant memory architecture for high performance massively parallel computing applications and its dependability assurance techniques are proposed and discussed in this paper. The proposed fault tolerant memory provides two distinctive repair mechanisms: the permanent laser redundancy reconfiguration during the wafer probe stage in the factory to enhance its ...

متن کامل

Automatic Generation of Fault-Tolerant CORBA-Services

The Common Object Request Broker Architecture (CORBA) is the most successful representative of an object-based distributed computing architecture. Although CORBA simplifies the implementation of complex, distributed systems significantly, the support of techniques for reliable, fault-tolerant software, such as group communication protocols or replication is very limited in the state-of-the-art ...

متن کامل

Fast Checkpoint/Recovery to Support Kilo-Instruction Speculation and Hardware Fault Tolerance

The increased relative cost of accessing memory is encouraging processor designers to explore deeper uniprocessor speculation (e.g., with branch and value prediction) and consider multiprocessor speculation (e.g., on coherence message types and values). While some mechanisms have been proposed to support deep speculation using speculative multithreading, current mechanisms for conventional proc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003